Efficient Binarization for Historical Document Analysis

نویسندگان

  • Florian Westphal
  • Håkan Grahn
  • Niklas Lavesson
چکیده

Readability of document images is one core issue when analysing historical documents. One way to improve the readability of those document images is image binarization. By separating the written text from its background, documents degraded by, e.g., stains or faded ink become better readable. Due to the large quantity of available historical document images, this binarization needs to be done efficiently to make this form of processing feasible. In this paper, we present our work in progress on improving the execution performance of a state-of-the-art binarization algorithm by mapping it onto a heterogenous platform. We describe how the algorithm can be divided and computed in parallel on CPU and GPU. The preliminary results, which we report in this paper, suggest that a speedup of 1.4 is possible in comparison to the original algorithm.

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

An Enhancement of Images Using Recursive Adaptive Gamma Correction

The “Adaptive Approach for Historical or Degraded Document Binarization” is that in which Libraries and Museums obtain in large gathering of ancient historical documents printed or handwritten in native languages. Typically, only a small group of people are allowed access to such collection, as the preservation of the material is of great concern. In recent years, libraries have begun to digiti...

متن کامل

Restoration of Degraded Historical Document Image: An Adaptive Multilayer-Information Binarization Technique

Binary image is the essential format for document image processing, and the operation of the subsequent steps depends on the quality of the binarization process. The objective of this research is to propose a new binarization method based on adaptive multilayer-information for restoration of degraded historical document images. This paper focuses on degraded Thai historical document images, whi...

متن کامل

An Adaptive Binarization Technique for Low Quality Historical Documents

Historical document collections are a valuable resource for human history. This paper proposes a novel digital image binarization scheme for low quality historical documents allowing further content exploitation in an efficient way. The proposed scheme consists of five distinct steps: a pre-processing procedure using a low-pass Wiener filter, a rough estimation of foreground regions using Nibla...

متن کامل

A Proposed Binarization Technique on Hand written document

Abstract: Binarization is performed in the preprocessing stage for document inspection. Binarization of degraded document images improve the result from poor quality of the paper, the printing process, ink blot and fading document and remove noise from examine. In recent years, libraries have begun to digitize historical document that are of interest to a wide range of people, with the goal of ...

متن کامل

A Performance Evaluation Methodology for Historical Document Image Binarization

Document image binarization is of great importance in the document image analysis and recognition pipeline since it affects further stages of the recognition process. The evaluation of a binarization method aids in studying its algorithmic behaviour and verifying its effectiveness by providing qualitative and quantitative indication of its performance. This work concerns a pixel-based binarizat...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2016